SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT : LI, ET AL . 

(ii) TITLE OF INVENTION: Neurotransmitter Transporter 



(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

CECCHI, STEWART & OLSTEIN 

(B) STREET: 6 BECKER FARM ROAD 

(C) CITY: ROSELAND 

(D) STATE: NEW JERSEY 

(E) COUNTRY: USA 

(F) ZIP: 07068 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 INCH DISKETTE 

(B) COMPUTER: IBM PS/2 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: WORD PERFECT 5.1 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: SUBMITTED HEREWITH 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) 



ATTORNEY /AGENT INFORMATION: 
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(A) NAME: FERRARO, GREGORY D. 

(B) REGISTRATION NUMBER: 36,134 

(C) REFERENCE /DOCKET NUMBER: 325800-118 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-994-1700 

(B) TELEFAX: 201-994-1744 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 2,486 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 
P (D) TOPOLOGY: LINEAR 

fy (ii) MOLECULE TYPE: cDNA 





<xi) 


SEQUENCE 


DESCRIPTION: SEQ 


ID NO:l: 






s; 

P 


CGGAGG CAGG 


GAGTGAGGAG 


CGAG CGGAGT 


CGCGTGCGCC 


GG CG CGAG CT 


CCGGGTCGCC 


60 




CCAGCCCCAG 


CCGGGGGCCT 


GTGG CGGGGG 


AGGAG CTGTG 


CGTCCGCGAC 


CCGTCGGGA 


12 0 


n 


TCGCAGCTGC 


TCGGCCGGAG 


TGCACGGGCC 


GAGTCTGCGC 


GACTACCCAC 


GCGTGACAGG 


180 




TCCCTGAATG 


AGAAGGAGCT 


GACAGCAGCT 


GAATTCCATC 


TTCTCTGTGT 


GCTGGGGAGC 


240 


a 


AGGGCTACAC 


GGCCCAGGTG 


GCATCAATGC 


CGAAGAACAG 


CAAAGTGACC 


CAGCGTGAGC 


300 




ACAG CAGTGA 


GCATGTCACT 


GAGTCCGTGG 


CCGACCTGCT 


GGCCCTCGAG 


GAGCCTGTGG 


360 




ACTATAAGCA 


GAGTGTACTG 


AATGTGG CTG 


GTGAGGCAGG 


CGG CAAGCAG 


AAGGCGGTGG 


420 




AGGAGGAGCT 


GGATG CAGAG 


GACCGGCCGG 


CCTGGAACAG 


TAAGCTGCAG 


TACATCCTGG 


480 




CCCAGATTGG 


CTTCTCTGTG 


GGCCTCGGCA 


ACAT CTGGAG 


GTTCCCCTAC 


CTGTG CCAGA 


540 




AAAATGGAGG 


AGGTG CTTAC 


CTGGTGCCCT 


AC CTGGTG CT 


G CTGAT CAT C 


ATCGGGATCC 


600 




CCCTCTTCTT 


C CTGGAG CTG 


GCTGTGGGTC 


AGAGGAT CCG 


CCGCGGAAGC 


ATCGGTGTGT 


660 




GGCACTATAT 


ATGTCCCCGC 


CTGGGGGGGA 


TCGGCTTCTC 


CAGCTGCATA 


GT CTGT CT CT 


720 




TTGTGGGGCT 


GTATTATAAT 


GTGAT CAT CG 


GGTGGAG CAT 


CTTCTATTTC 


TTCAAGTCCT 


780 




TCCAGTACCC 


GCTGCCCTGG 


AGTGAATGTC 


CTGT CGTCAG 


GAATGGGAGC 


GTCGCAGTGG 


840 




TGGAGG CAGA 


GTGTGAAAAG 


AGCTCAGCCA 


CTAC CTACTT 


CTGGTACCGA 


GAGGCTTTGG 


900 




ACATCTCTGA 


CTCCATCTCG 


GAGAGTGGGG 


GCCTCAACTG 


GAAGATGACC 


CTGTGCCTCC 


960 




TCGTGGTCTG 


GAG CAT CGGG 


GGGATGG CTG 


TCGGTAAGGG 


CATCCAGTCC 


TCGGGGAAGG 


1020 
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rn/^» 71 T'fl'T' R ' 1 " 1 * 1 1 
iuAlblAl x 1 


pagptccctc 

» X WWW X W 


TTCCCCTACG 




Tbl ILjI XVJ wo- 


a nnnn pagtt 

AUUUVJ w^-\w X x 


GATGGCATCC 




rn/-* i^rpr 1 A /T^P 

TG CT w<iAw \- w 


p p AfwrnTPrG 

w 1 vJ 1 ww 


CGGGAGGTAG 




GLITivw 1 oij 




tt PT P CAGTT 

X X W X W W*"»W X X 




ATGG CG C C CT 


IjwJ lul w w X J. w 


X V_rvri\_ X X w X 




TTGTTGTTTT 


r'PPPTTPAAH 


GPPAACAT PA 

WJWWU^W-T^X 




AGAAAAT L w 1 


Auuu InUV. X i> 


AAPAPPAACG 




TCAAw u LIU 


V_ v_ X W 


ACAAAGGACT 




TGAAtjrCjAvjviA 


V, \ M.w X X X w-r* 


GCCCTGGGCC 

w: w w w x w^w/w w w 




AwAAtj i v_v_Li x 




GGCCTGGCCT 




TCCLv-ALL 1 *_ 


p rrmr ptgg 

V_ W WW X 1V.1 WW 


TCCGTCATGT 




\j L-Avj wA 1 uA 1 




GCAGGCATCA 




L CAAvjvjAIjA x 


Oil X w 


GG PTG CTGTG 




TLwALjwLjw x l_ 




TTTGT CAC CA 




TCAL 1 V- 1 LAI 


WW X WJ"*X WW x X 


GAGAACAT CG 




rn /""» A r P/^' A fT"' A 

T C_A 1 Lt LAuuA 




ATG CTGGG CT 




f~*t~* a A /^"T" 1 THT 
LjCxAAvj 1 1 v_vj 1 


HT PT P PAPTA 

Vj X w X w w** W X 


TGCATGGCTG 


5 : 




p p pgg p ptac 

WW www w w X w 


AGCGCCTGGA 




AH I LLLLAA 


prr; ci p p p RTR 
w X www wwJ-lx w 


GCACTCCTGA 

VJ V^tt. X W W X W^k 




I iLLivjl waj X 


^tt PP^T P PTG 

w X X WW X W W X W 


CGGCACTTCC 


fy 


LLulbitLiA 


PAAGAAGGCC 


CG CATGATGA 




AGACCCGCTT 


-7» i-r> /~> ( *>Ti/^A/*^f^ 

CATCCTLAuL 


a. ^rin^rn p p P a 




GTTCCTATCT 


GGGGCCCGGC 


AG CACAT CAC 




C CTATGGG CG 


CGGCTACCTG 


CTGGCCAGCA 




AGCCCATGCC 


CGCTCTCCCC 


CCACCG 




TGGTGCTGGC 


CTGCTTCCTG 


GTCCGGGGGT 


1080 


TACACATGTT 


CACTCCCAAG 


CTGGTCAAGA 


1140 


CTACCCAGGT 


CTTCTTTGGC 


TTGGGTCTGG 


1200 


ACAATAAG CA 


GGACAACAAC 


TGCCACTTCG 


1260 


TCACGTCAGT 


GTTGGCCACC 


CTCGTGGTGT 


1320 


TGAATGAGAA 


GTGTGTGGTC 


GAGAATGCTG 


1380 


TCCTGAGCCG 


GGACCTCATC 


CCACCCCACG 


1440 


ACATGGAGAT 


GGACAATGTC 


AT CATGACCG 


1500 


TTGACCCCTG 


CCTTCTGGAG 


GACGAG CTGG 


1560 


TCATCGCCTT 


CACTGAGGCC 


ATGACG CACT 


1620 


TCTTCTTGAT 


GCTTATCAAC 


CTGGG C CTGG 


1680 


CCACGCCCAT 


CATCGACACC 


TCCAAGGTGC 


1740 


TCTTTACATT 


CCTCGTGGGA 


CTGTTGTTCG 


1800 


TGTTCGATGA 


CTACTCAGCC 


ACGCTGCCAC 


1860 


CTGTGGCCTG 


GATTTATGGA 


CCCAAGAAGT 


1920 


TCCGCCCCTA 


CCGCTTCTAT 


TTCTACATGT 


1980 


TGCTCACCAC 


AGCCAGCATC 


AT C CAG CTGG 


2040 


TCAAGGAGGA 


GGCTGCCGAG 


CGCTACCTGT 


2100 


TCACCCTCAT 


CGTCGTGGCG 


ACGCTGCCCA 


2160 


ACCTGCTCTC 


TGATGGCTCC 


AACACCCTCT 


2220 


AGGACAT CT C 


CAACCTGGAG 


GAGAACGATG 


2280 


GTGAGG CAC C 


TTLLLLLAIu 


LLwHL X w>*.w w 


At J t VJ 


CCCTGGAGAC 


CAGCTGGAAC 


CCCAATGGAC 


2400 


CCCCTGAGTC 


TGAGCTGTGA 


CCACTGCCCA 


2460 
2486 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 727 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNESS : 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Pro Lys Asn Ser Lys Val Thr Gin Arg Glu His Ser Ser Glu 

5 10 15 
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His Val Thr Glu Ser Val Ala Asp Leu Leu Ala Leu Glu Glu Pro 

20 25 30 

Val Asp Tyr Lys Gin Ser Val Leu Asn Val Ala Gly Glu Ala Gly 

35 40 45 

Gly Lys Gin Lys Ala Val Glu Glu Glu Leu Asp Ala Glu Asp Arg 

50 55 60 

Pro Ala Trp Asn Ser Lys Leu Gin Tyr lie Leu Ala Gin lie Gly 

65 70 75 

Phe Ser Val Gly Leu Gly Asn lie Trp Arg Phe Pro Tyr Leu Cys 

80 85 90 

Gin Lys Asn Gly Gly Gly Ala Tyr Lys Val Pro Tyr Leu Val Leu 

95 100 105 

Leu lie He He Gly He Pro Leu Phe Phe Leu Glu Leu Ala Val 

110 115 120 

Gly Gin Arg He Arg Arg Gly Ser He Gly Val Trp His Tyr lie 

125 130 135 

Cys Pro Arg Leu Gly Gly He Gly Phe Ser Ser Cys He Val Cys 

140 145 150 

Leu Phe Val Gly Leu Tyr Tyr Asn Val He He Gly Trp Ser He 

155 160 165 

Phe Tyr Phe Phe Lys Ser Phe Gin Tyr Pro Leu Pro Trp Ser Glu 

170 175 180 

Cys Pro Val Val Arg Asn Glu Ser Val Ala Val Val Glu Ala Glu 

185 190 195 

Cys Glu Lys Ser Ser Ala Thr Thr Tyr Phe Trp Tyr Arg Glu Ala 

200 205 210 

Leu Asp He Ser Asp Ser He Ser Glu Ser Gly Gly Leu Asn Trp 

215 220 225 

Lys Met Thr Leu Cys Leu Leu Val Val Trp Ser lie Gly Gly Met 

230 235 240 

Ala Val Gly Lys Gly He Gin Ser Ser Gly Lys Val Met Tyr Phe 

245 250 255 

Ser Ser Leu Phe Pro Tyr Val Val Leu Ala Cys Phe Leu Val Arg 

260 265 270 



Gly Leu Leu 
Thr Pro Lys 
Val Ala Thr 
val lie Val 
Phe Asp Gly 
Leu Ala Thr 
lie Met Asn 
Gly Tyr Leu 
His Val Asn 
Asp Asn Val 
Gly Leu Asp 
Gin Gly Thr 
His Phe Pro 
Leu lie Asn 
lie Thr Thr 
Phe Thr Val 
Phe Val Gin 
Tyr Ser Ala 



Leu Arg Gly 

275 
Leu Val Lys 

290 
Gin Val Phe 

305 

Phe Ser Ser 
320 

Ala Leu Val 

335 
Leu Val Val 

350 
Glu Lys Cys 

365 

Asn Thr Asn 

380 
Phe Ser His 

395 
lie Met Thr 

410 
Pro Cys Leu 

425 
Gly Leu Ala 

440 
Thr Ser Pro 

455 
Leu Gly Leu 

470 
Pro lie lie 

485 
Gly Cys Cys 

500 

Arg Ser Gly 
515 

Thr Leu Pro 



Ala Val Asp 
Met Leu Asp 
Phe Gly Leu 
Tyr Asn Lys 
Ser Phe lie 
Phe Val Val 
Val Val Glu 
Val Leu Ser 
Leu Thr Thr 
Val Lys Glu 
Leu Glu Asp 
Phe lie Ala 
Phe Trp Ser 
Gly Ser Met 
Asp Thr Ser 
Val Phe Thr 
Asn Tyr Phe 
Leu Thr Leu 



Gly lie Leu 
280 

Pro Gin Val 
295 

Gly Leu Gly 
310 

Gin Asp Asn 
325 

Asn Phe Phe 
340 

Leu Gly Phe 
355 

Asn Ala Glu 
370 

Arg Asp Leu 
385 

Lys Asp Tyr 
400 

Asp Gin Phe 
415 

Glu Leu Asp 
430 

Phe Thr Glu 
445 

Val Met Phe 
460 

lie Gly Thr 
475 

Lys Val Pro 
490 

Phe Leu Val 
505 

Val Thr Met 
520 

He Val He 



His Met Phe 
285 

Trp Arg Glu 
300 

Phe Gly Gly 
315 

Asn Cys His 
330 

Thr Ser Val 
345 

Lys Ala Asn 
360 

Lys He Leu 
375 

lie Pro Pro 
390 

Met Glu Met 
405 

Ser Ala Leu 
420 

Lys Ser Val 
435 

Ala Met Thr 
450 

Phe Leu Met 
465 

Met Ala Gly 
480 

Lys Glu Met 
495 

Gly Leu Leu 
510 

Phe Asp Asp 
525 

Leu Glu Asn 



530 535 540 

lie Ala Val Ala Trp lie Tyr Gly Pro Lys Lys Phe Met Gin Glu 

545 550 555 

Leu Thr Glu Met Leu Gly Phe Arg Pro Tyr Arg Phe Tyr Phe Tyr 

560 565 570 

Met Trp Lys Phe Val Ser Pro Leu Cys Met Ala Val Leu Thr Thr 

575 580 585 

Ala Ser lie lie Gin Leu Gly Val Thr Pro Pro Ala Tyr Ser Ala 

590 595 600 

Trp lie Lys Glu Glu Ala Ala Glu Arg Tyr Leu Tyr Phe Pro Asn 

605 610 615 

Trp Pro Met Ala Leu Leu lie Thr Leu lie Val Val Ala Thr Leu 

620 625 630 

Pro lie Pro Val Val Phe Val Leu Arg His Phe His Leu Leu Ser 

p 635 640 - 645 

Asp Gly Ser Asn Thr Leu Ser Val Ser Tyr Lys Lys Ala Arg Met 

650 655 660 
Met Lys Asp lie Ser Asn Leu Glu Glu Asn Asp Glu Thr Arg Phe 

665 670 675 
lie Leu Ser Lys Val Pro Ser Glu Ala Pro Ser Pro Met Pro Thr 

680 685 690 
His Arg Ser Tyr Leu Gly Pro Gly Ser Thr Ser Pro Leu Glu Thr 

695 700 705 
Ser Trp Asn Pro Asn Gly Pro Tyr Gly Arg Gly Tyr Leu Leu Ala 

710 715 720 
Ser Thr Pro Glu Ser Glu Leu 

725 



ft xi 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: LI , YI 

FLEISCHMANN, ROBERT 

(ii) TITLE OF INVENTION: NEUROTRANSMITTER TRANSPORTER 
(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences, Inc. 

(B) STREET: 9410 Key West Avenue 

(C) CITY: Rockville 

(D) STATE: MD 

(E) COUNTRY: US 

(F) ZIP: 20850 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1-0, Version #1.3 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/424,424 

(B) FILING DATE: 21-APR-1995 

( C ) CLASSIFICATION : 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Michele M. Wales 

(B) REGISTRATION NUMBER: 43,975 
<C) REFERENCE /DOCKET NUMBER: PF116 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX: (301) 309-8439 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2485 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 266.-2446 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CGGAGGCAGG GAGTGAGGAG CGAGCGGAGT CGCGTGCGCC GGCGCGAGCT CCGGGTCGCC 
CCAGCCCCAG CCGGGGGCCT GTGGCGGGGG AGGAGCTGTG CGTCCGCGAC CCGTCGGGAT 
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CGCAGCTGCT CGGCCGGAGT GCACGGGCCG AGTCTGCGCG ACTACCCACG CGTGACAGGT 180 

CCCTGAATGA GAAGGAGCTG ACAGCAGCTG AATTCCATCT TCTCTGTGTG CTGGGGAGCA 240 

GGGCTACACG GCCCAGGTGG CATCA ATG CCG AAG AAC AGC AAA GTG ACC CAG 292 

Met Pro Lys Asn Ser Lys Val Thr Gin 
1 5 



CGT GAG CAC AGC ACT GAG CAT GTC ACT GAG TCC GTG GCC GAC CTG CTG 340 
Arg Glu His Ser Ser Glu His Val Thr Glu Ser Val Ala Asp Leu Leu 
10 15 20 25 

GCC CTC GAG GAG CCT GTG GAC TAT AAG CAG AGT GTA CTG AAT GTG GCT 388 
Ala Leu Glu Glu Pro Val Asp Tyr Lys Gin Ser Val Leu Asn Val Ala 
30 -35 40 

GGT GAG GCA GGC GGC AAG CAG AAG GCG GTG GAG GAG' GAG CTG GAT GCA 43 6 

Gly Glu Ala Gly Gly Lys Gin Lys Ala Val Glu Glu Glu Leu Asp Ala 
45 50 55 

GAG GAC CGG CCG GCC TGG AAC AGT AAG CTG CAG TAC ATC CTG GCC CAG 
Glu Asp Arg Pro Ala Trp Asn Ser Lys Leu Gin Tyr lie Leu Ala Gin 
60 65 70 

ATT GGC TTC TCT GTG GGC CTC GGC AAC ATC TGG AGG TTC CCC TAC CTG 
He Gly Phe Ser Val Gly Leu Gly Asn He Trp Arg Phe Pro Tyr Leu 
75 80 85 



TGC CAG AAA AAT GGA GGA GGT GCT TAC CTG GTG CCC TAC CTG GTG CTG 
Cys Gin Lys Asn Gly Gly Gly Ala Tyr Leu Val Pro Tyr Leu Val Leu 
90 . 95 100 105 

CTG ATC ATC ATC GGG ATC CCC CTC TTC TTC -CTG GAG CTG GCT GTG GGT 
Leu He He He Gly He Pro Leu Phe Phe Leu Glu Leu Ala Val Gly 
110 .115 120 

CAG AGG ATC CGC CGC GGA AGC ATC GGT GTG TGG CAC TAT ATA TGT CCC 
Gin Arg He Arg Arg Gly Ser He Gly Val Trp His Tyr lie Cys Pro 
125 130 I 35 

CGC CTG GGG GGG ATC GGC TTC TCC AGC TGC ATA GTC TGT CTC TTT GTG 
Arg Leu Gly Gly He Gly Phe Ser Ser Cys He Val Cys Leu Phe Val 
140 145 150 

GGG CTG TAT TAT AAT GTG ATC ATC GGG TGG AGC ATC TTC TAT TTC TTC 
Gly Leu Tyr Tyr Asn Val He He Gly Trp Ser He Phe Tyr Phe Phe 
155 160 165 - 

AAG TCC TTC CAG TAC CCG CTG CCC TGG AGT GAA TGT CCT GTC GTC AGG 
Lys Ser Phe Gin Tyr Pro Leu Pro Trp Ser Glu Cys Pro Val Val Arg 
170 175 180 185 

AAT GGG AGC GTC GCA GTG GTG GAG GCA GAG TGT GAA AAG AGC TCA GCC 
Asn Gly Ser Val Ala Val Val Glu Ala Glu Cys Glu Lys Ser Ser Ala 
190 195 200 

ACT ACC TAC TTC TGG TAC CGA GAG GCT TTG GAC ATC TCT GAC TCC ATC 
Thr Thr Tyr Phe Trp Tyr Arg Glu Ala Leu Asp He Ser Asp Ser He 
205 210 - 215 

TCG GAG AGT GGG GGC CTC AAC TGG AAG ATG ACC CTG TGC CTC CTC GTG 
Ser Glu Ser Gly Gly Leu Asn Trp Lys Met Thr Leu Cys Leu Leu Val. 
220 225 230 



484 



532 



580 



628 



676 



724 



772 



820 



868 



916 



964 
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GTC TGG AGC ATC GGG GGG ATG GCT GTC GGT AAG GGC ATC CAG TCC TCG 1012 
Val Trp Ser He Gly Gly Met Ala Val Gly Lys Gly He Gin Ser Ser 
235 240 245 

GGG AAG GTG ATG TAT TTC AGC TCC CTC TTC CCC TAC GTG GTG CTG GCC 1060 
Glv Lys Val Met Tyr Phe Ser Ser Leu Phe Pro Tyr Val Val Leu Ala 
250 255 260 265 

TGC TTC CTG GTC CGG GGG TTG TTG TTG CGA GGG GCA GTT GAT GGC ATC 1108 
Cvs Phe Leu Val Arg Gly Leu Leu Leu Arg Gly Ala Val Asp Gly He 
270 275 280 

CTA CAC ATG TTC ACT CCC AAG CTG GTC AAG ATG CTG GAC CCC CAG GTG 1156 
Leu His Met Phe Thr Pro Lys Leu Val Lys Met Leu Asp Pro Gin Val 
285. 290 295 

TGG CGG GAG GTA GCT ACC CAG GTC TTC TTT GGC TTG GGT CTG GGC TTT 12 04 

Trp Arg Glu Val Ala Thr Gin Val Phe Phe Gly Leu Gly Leu Gly Phe 
300 . 305 310 

GGT GGT GTC ATT GTC TTC TCC AGT TAC AAT AAG CAG GAC AAC AAC TGC 1252 
Gly Gly Val He Val Phe Ser Ser Tyr Asn Lys Gin Asp Asn Asn Cys 
315 320 325 

CAC TTC GAT GGC GCC CTG GTG TCC TTC ATC AAC TTC TTC ACG TCA GTG 13 00 

His Phe Asp Gly Ala Leu Val Ser Phe He Asn Phe Phe Thr Ser Val 
IU 330 335 340 345 

S.J 

„jp TTG GCC ACC CTC GTG GTG TTT GTT GTT TTG GGC TTC AAG GCC AAC ATC 1348 

»fc Leu Ala Thr Leu Val Val Phe Val Val Leu Gly Phe Lys Ala Asn He 

i= 350 355 360 

W ATG AAT GAG AAG TGT GTG GTC GAG AAT GCT GAG AAA ATC CTA GGG TAC 13 96 

Met Asn Glu Lys Cys Val Val Glu Asn Ala Glu Lys He Leu Gly Tyr - 

fiU 365 370 375 

5 CTT AAC ACC AAC GTC CTG AGC CGG GAC CTC ATC CCA CCC CAC GTC AAC 1444 

(3 Leu Asn Thr Asn Val Leu Ser Arg Asp Leu He Pro Pro His Val Asn 

il 380 385 390 

TTC TCC CAC CTG ACC ACA AAG GAC TAC ATG GAG ATG GAC AAT GTC ATC 1492 
Phe Ser His Leu Thr Thr Lys Asp Tyr Met Glu Met Asp Asn Val He 
395 400 405 

ATG ACC GTG AAG GAG GAC CAG TTC TCA GCC CTG GGC CTT GAC CCC TGC 1540 
Met Thr Val Lys Glu Asp Gin Phe Ser Ala Leu Gly Leu Asp Pro Cys 
410 415 420 425 

CTT CTG GAG GAC GAG CTG GAC AAG TCC GTG CAG GGC ACA GGC CTG GCC 1588 
Leu Leu Glu Asp Glu Leu Asp Lys Ser Val Gin Gly Thr Gly Leu Ala 
430 435 440 

TTC ATC GCC TTC ACT GAG GCC ATG ACG CAC TTC CCC ACC TCC CCG TTC 1636 
Phe He Ala Phe Thr Glu Ala Met Thr His Phe Pro Thr Ser Pro Phe 
445 450 455 

TGG TCC GTC ATG TTC TTC TTG ATG CTT ATC AAC CTG GGC CTG GGC AGC 1684 
Trp Ser Val Met Phe Phe Leu Met Leu He Asn Leu Gly Leu Gly Ser 
460 465 470 



ATG ATC GGG ACC ATG GCA GGC ATC ACC ACG CCC ATC ATC GAC ACC TCC 
Met He Gly Thr Met Ala Gly He Thr Thr Pro He He Asp Thr Ser 



1732 
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475 480 485 



AAG GTG CCC AAG GAG ATG TTC ACA GTG GGC TGC TGT GTC TTT ACA TTC 1780 
Lvs Val Pro Lys Glu Met Phe Thr Val Gly Cys Cys Val Phe Thr Phe 
490 495 500 505 

CTC GTG GGA CTG TTG TTC GTC CAG CGC TCC GGA AAC TAC TTT GTC ACC 1828 
Leu Val Gly Leu Leu Phe Val Gin Arg Ser Gly Asn Tyr Phe Val Thr 
510 515 520 

ATG TTC GAT GAC TAC TCA GCC ACG CTG CCA CTC ACT CTC ATC GTC ATC 1876 
Met Phe Asp Asp Tyr Ser Ala Thr Leu Pro Leu Thr Leu He Val He 
525 530 535 

CTT GAG AAC ATC GCT GTG GCC TGG ATT TAT GGA CCC AAG AAG TTC ATG 1924 
Leu Glu Asn He Ala Val Ala Trp He Tyr Gly Pro Lys Lys Phe Met 
540 545 550 

CAG GAG CTG ACG GAG ATG CTG GGC TTC CGC CCC TAC CGC TTC TAT TTC 1972 
Gin Glu Leu Thr Glu Met Leu Gly Phe Arg Pro Tyr Arg Phe Tyr Phe 
555 560 565 

TAC ATG TGG AAG TTC GTG TCT CCA CTA TGC ATG GCT GTG CTC ACC ACA 2 020 

r=i Tvr Met Trp Lys Phe Val Ser Pro Leu Cys Met Ala Val Leu Thr Thr - 

J 570 575 580 585 

S GCC AGC ATC ATC CAG CTG GGG GTC ACG CCC CCG GCC TAC AGC GCC TGG 2068 

8y Ala Ser He He Gin Leu Gly Val Thr Pro Pro Ala Tyr Ser Ala Trp 
W 590 595 600 

>'fc 

■P ATC AAG GAG GAG GCT GCC GAG CGC TAC CTG TAT TTC CCC AAC TGG CCC 2116 

»C He Lys Glu Glu Ala Ala Glu Arg Tyr Leu Tyr Phe Pro Asn Trp Pro 

605 610 615 

5 ATG GCA CTC CTG ATC ACC CTC ATC GTC GTG GCG ACG CTG CCC ATC CCT 2164 

« Met Ala Leu Leu He Thr Leu He Val Val Ala Thr Leu Pro He Pro 

U 620 625 630 

pj 

O GTG GTG TTC GTC CTG CGG CAC TTC CAC CTG CTC TCT GAT GGC TCC AAC 2212 

M Val Val Phe Val Leu Arg His Phe His Leu Leu Ser Asp Gly Ser Asn 
635 640 645 

ACC CTC TCC GTG TCC TAC AAG AAG GCC CGC ATG ATG AAG GAC ATC TCC 22 60 

Thr Leu Ser Val Ser Tyr Lys Lys Ala Arg Met Met Lys Asp He Ser 
650 655 660 665 

AAC CTG GAG GAG AAC GAT GAG ACC CGC TTC ATC CTC AGC AAG GTG CCC 2308 
Asn Leu Glu Glu Asn Asp Glu Thr Arg Phe He Leu Ser Lys Val Pro 
670 675 680 

AGT GAG GCA CCT TCC CCC ATG CCC ACT CAC CGT TCC TAT CTG GGG CCC 2356 
Ser Glu Ala Pro Ser Pro Met Pro Thr His Arg Ser Tyr Leu Gly Pro 
685 690 695 

GGC AGC ACA TCA CCC CTG GAG ACC AGC TGG AAC CCC AAT GGA CCC TAT 2404 
Gly Ser Thr Ser Pro Leu Glu Thr Ser Trp Asn Pro Asn Gly Pro Tyr 
700 705 . 710 

GGG CGC GGC TAC CTG CTG GCC AGC ACC CCT GAG TCT GAG CTG 2446 
Gly Arg Gly Tyr Leu Leu Ala Ser Thr Pro Glu Ser Glu Leu 
715 720 725 

TGACCACTGC CCAAGCCCAT GCCCGCTCTC CCCCCACCG 2485 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 727 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met 



Pro Lys Asn Ser Lys Val Thr Gin Arg Glu His Ser Ser Glu His 



10 



Val Thr Glu Ser Val Ala Asp Leu Leu Ala Leu Glu Glu Pro Val Asp 
20 25 30 

Tyr Lys Gin Ser Val Leu Asn Val Ala Gly Glu Ala Gly Gly Lys Gin 
35 40 45 

Lys Ala Val Glu Glu Glu Leu Asp Ala Glu Asp Arg Pro Ala Trp Asn 
50 55 60 ■ - 

Ser Lys Leu Gin Tyr He Leu Ala Gin lie Gly. Phe Ser Val Gly Leu 

65 70 75 

Gly Asn He Trp Arg Phe Pro Tyr Leu Cys Gin Lys Asn Gly Gly Gly 
85 90 

Ala Tyr Leu Val Pro Tyr Leu Val Leu Leu He He He Gly He Pro 
100 105 

Leu Phe Phe Leu Glu Leu Ala Val Gly Gin Arg He Arg Arg Gly Ser 
115 120 125 



I 



le Gly Val Trp His Tyr He Cys Pro Arg Leu Gly Gly He Gly Phe 



135 140 



130 

Ser Ser Cys He Val Cys Leu Phe Val Gly Leu Tyr Tyr Asn Val He 
145 150 155 

He Gly Trp Ser lie Phe Tyr Phe Phe Lys Ser Phe Gin Tyr Pro Leu 

Pro Trp Ser Glu Cys Pro Val Val Arg Asn Gly Ser Val Ala Val Val 
180 185 I 90 

Glu Ala Glu Cys Glu Lys Ser Ser Ala Thr Thr Tyr Phe Trp Tyr Arg 
195 200 205 

Glu Ala Leu Asp He Ser Asp Ser He Ser Glu Ser Gly Gly Leu Asn 
210 215 220 

Trp Lys Met Thr Leu Cys Leu Leu Val Val Trp Ser He Gly Gly Met 
225 230 235 

Ala Val Gly Lys Gly He Gin Ser Ser Gly Lys Val Met Tyr Phe Ser 
245 250 25b 

Ser Leu Phe Pro Tyr Val Val Leu Ala Cys Phe Leu Val Arg Gly Leu 
260 265 270 
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Leu Leu Arg Gly Ala Val Asp Gly He Leu His Met Phe Thr Pro Lys 
275 28 9 285 

Leu Val Lys Met Leu Asp Pro Gin Val Trp Arg Glu Val Ala Thr Gin 
290 295 300 

Val Phe Phe Gly Leu Gly Leu Gly Phe Gly Gly Val He Val Phe Ser 
305 310 315 ^0 

Ser Tyr Asn Lys Gin Asp Asn Asn Cys His Phe Asp Gly Ala Leu Val 
325 330 335 

Ser Phe He Asn Phe Phe Thr Ser Val Leu Ala Thr Leu Val Val Phe 
340 345 J => u 

Val Leu Gly Phe Lys Ala Asn He Met Asn, Glu Lys Cys Val Val 



Val 

355 



360 365 



Glu Asn Ala Glu Lys He Leu Gly Tyr Leu Asn Thr Asn Val Leu Ser 
370 375 380 

Arg Asp Leu He Pro Pro His Val Asn Phe Ser His Leu Thr Thr Lys 
385 390 395 

Asp Tyr Met Glu Met Asp Asn Val He Met Thr Val Lys Glu Asp Gin 

Phe Ser Ala Leu Gly Leu Asp Pro Cys Leu Leu Glu Asp Glu Leu Asp 
42 0 425 

Lys Ser Val Gin Gly Thr Gly Leu Ala Phe He Ala Phe Thr Glu Ala 
435 440 445 

Met Thr His Phe Pro Thr Ser Pro. Phe Trp Ser Val Met Phe Phe Leu 
450 4 55 460 



Met- Leu 
465 



lie Asn Leu Gly Leu Gly Ser Met He Gly Thr Met Ala Gly 
470 4 "75 480 

lie Thr Thr Pro He He Asp Thr Ser Lys Val Pro Lys Glu Met Phe 
485 4 9° 495 

Thr Val Gly Cys Cys Val Phe Thr Phe Leu Val Gly Leu Leu Phe Val 



500 



505 



Gin Arg Ser Gly Asn Tyr Phe Val Thr Met Phe Asp Asp Tyr Ser Ala 
515 520 525 

Thr Leu Pro Leu Thr Leu He Val He Leu Glu Asn He Ala Val Ala 
530 535 540 

Trp He Tyr Gly Pro Lys Lys Phe Met Gin Glu Leu Thr Glu Met Leu 
545 550 555 

Gly Phe Arg Pro Tyr Arg Phe Tyr Phe Tyr Met Trp Lys Phe Val Ser 
* 570 =75 



565 



Pro Leu Cys Met Ala Val Leu Thr Thr Ala Ser He He Gin Leu Gly 
580 585 590 

Val Thr Pro Pro Ala Tyr Ser Ala Trp He Lys Glu Glu Ala Ala Glu 
595 600 605 
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Arg Tyr Leu Tyr Phe Pro Asn Trp Pro Met Ala Leu Leu lie Thr Leu 
610 615 620 

lie Val Val Ala Thr Leu Pro lie Pro Val Val Phe Val Leu Arg His 
625 630 635 640 

Phe His Leu Leu Ser Asp Gly Ser Asn Thr Leu Ser Val Ser Tyr Lys 
645 650 655 

Lys Ala Arg Met Met Lys Asp lie Ser Asn Leu Glu Glu Asn Asp Glu 
660 665 670 

Thr Arg Phe He Leu Ser Lys Val Pro Ser Glu Ala Pro Ser Pro Met 
675 680 685 

Pro Thr His Arg Ser Tyr Leu Gly Pro Gly Ser Thr Ser Pro Leu Glu 
690 695 700 

Thr Ser Trp Asn Pro Asn Gly Pro Tyr Gly Arg Gly Tyr Leu Leu Ala 
705 710 715 720 

Ser Thr Pro Glu Ser Glu Leu 
725 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 
(AT LENGTH: 30 base, pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
GACTAAAGCT TGGCATCAAT GCCGAAGAAC 
(2) INFORMATION FOR SEQ ID NO : 4 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GAACTTCTAG AGCAGTGGTC ACAGCTCAG 

( 2 ) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 3 8 base pairs 



(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 
GACTAAGATC TGCCACCATG CCGAAGAACA GCAAAGTG 
( 2). INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
GAACTGATAT CGCAGTGGTC ACAGCTCAG 



